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Abstract 

Digital fingerprinting is a technique that consists of inserting the ID of an authorized user in the digital content that he 
requests. This technique has been mainly used to trace back pirate copies of multimedia content such as images, audio, and 
video. This study proposes the use of state-of-the-art digital fingerprinting techniques in the context of restricted 
distribution of digital documents. In particular, the system proposed by Kuribayashi for multimedia content is investigated. 
Extensive simulations show the robustness of the proposed system against average collusion attack. Perceptual 
transparency of the fingerprinted documents is also studied. Moreover, by using an efficient Fast Fourier Transform core and 
standard computer machines it is shown that the proposed system is suitable for real-world scenarios. 
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Introduction 

The adoption of information systems is changing the way 
organizations work, allowing the automation of their processes and 
thus making them more efficient. One of the recurring processes of 
these organizations is the management of the document's lifecycle 
which is the workflow that defines how users interact with the 
documents from their creation until their usage in other 
organization's processes. Currently, many organizations have 
digitized their physical documents to automate the document's 
lifecycle which depends on the type of information contained in 
the digital documents, but has a common flow [1-3]: 

1 . Creation: A user generates a digital document. 

2. Review: The digital document is evaluated. Then, it is 
determined if the digital document is ready to be released. 

3. Storage: The digital document is stored in a repository of the 
organization. 

4. Distribution: The digital document can be accessed by 
authorized users. 

5. Usage: Authorized users access and manipulate the digital 
document. 

In this lifecycle, digital documents with sensible information 
must be available only for authorized users. Cryptography and 
other security techniques can be used to protect documents during 
their storage and distribution, but once an authorized user obtains 
a clear copy document it has no more protection. The authorized 
user could act dishonestly, by distributing the document to 
unauthorized entities. The document copy distributed illegally is 
known as pirate document and the user that distributes it is named 
traitor. The problem of pirate documents can continue if it is not 
possible to identify the traitor [4-7]. To solve this problem, 



invisible watermarking techniques can be used to insert the ID of 
the user in the document requested. If a pirate copy is detected, it 
would be possible to identify the traitor by detecting the user's ID, 
which was inserted when the document was distributed to that 
user. This way of using the watermarking techniques is known as 
digital fingerprinting. 

Digital fingerprinting was originally used to face the illegal 
distribution of multimedia content such as images [8], audio [9], 
and video [10], but this technique has been also used to protect 
digital documents [11], persuading users not to distribute pirate 
copies and detecting the users who do it. In this study, 
"fingerprinting" will be used as a synonym for "digital 
fingerprinting" and "fingerprint" will be used to refer to the 
unique ID of each user. 

For the effectiveness of fingerprinting techniques, it is necessary 
to satisfy two main properties: 

• Perceptual transparency (namely unobtrusiveness, invisibility, 
or imperceptibility): The original document and its finger- 
printed copy must be identical to the user perception. This is 
measured using the natural language understanding: If the 
document is entirely legible after the fingerprint insertion, this 
property is achieved [12,13], 

• Robustness: It is the capacity of user's IDs to survive 
intentional, and unintentional attacks after being inserted into 
the document. If the fingerprint is removed or destroyed, the 
value of the digital document is lost [8] . 

In an organizational environment, an attack that is prone to 
happen is a collusion attack. This attack occurs when a set of 
traitors obtain a copy of a document, each copy having its own 
fingerprint. Traitors can perform an average of the copies to 
generate a new pirate document destroying their fingerprints in 
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Table 1. Configuration parameters for fingerprint insertion. 







Parameter 


Initial value 


Increment 


Maximum value 


ft, 


50,000 


50,000 


1,200,000 


ft 


50,000 


50,000 


1,200,000 


L 


50,000 


50,000 


1,200 000 




1/6 


1/6 


5/6 
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the process. Traitors who perform a collusion attack are named 
colluders. 

Related Work 

Different watermarking insertion techniques have been pro- 
posed for digital documents that can be used in fingerprinting. 
These techniques can be syntactic techniques, semantic tech- 
niques, and image-based techniques [12,14]. In this study, digital 
documents are represented as images; hence, from these three 
techniques, image-based techniques are chosen. 

The first techniques for watermarking on digital documents 
represented as images were proposed by Brassil et al. [15] and Low 
et al. [16]. They consist of inserting watermarks by shifting the text- 
lines in a vertical way allowing a bit encode per line. To detect the 
watermark, the distance between the lines is measured. These 
techniques only work for formatted documents and the inserted 
watermark is easy to remove by an averaging collusion attack. In 



[17,18], techniques based on horizontally shifting of words in the 
document were proposed. However, these techniques can only be 
applied to documents that have variable spacing between adjacent 
words. The watermark detection is only possible when having the 
knowledge of the space between words in the original document. 
Other techniques based on the distance between lines, words, or 
characters have been proposed in [19-25], and most of them have 
been proved to be robust against indirect attacks such as copying, 
printing, and scanning. Also, specific schemes have been proposed 
to face these problems [26,27]. However, the robustness of these 
techniques against collusion attacks is not reported. 

Spread spectrum techniques for insertion have been widely used 
for natural images because they are robust for a wide range of 
attacks, including collusion attacks. Cox et al. [8] proposed the first 
watermarking technique using spread spectrum. In this technique, 
the user's fingerprint is represented as a spread spectrum sequence 
that is inserted in the most significant frequency regions of the 
image, because the less significant regions tend to be discarded 
when applying filters or other techniques of image processing. 
When the embedded sequence is extracted, it is necessary to 
establish a correlation with all the known user sequences to detect 
the traitor. This strategy increases the detection time linearly, and 
under a collusion attack it considers that all the users are likely to 
collude, which is not necessarily true. Despite the spread spectrum 
being used in [28] and [29] as an insertion technique for digital 
documents, its resistance to collusion attacks has not been 
reported. 

Wang et al. [30] have considered that traitors are more likely to 
collude with users who share common characteristics such as social 




x10 



Figure 1. Distortion generated in digital documents due to fingerprint insertion varying the energy assigned to users (ft,) and 
groups!/; ,), with fixed values of fingerprint length (L) and insertion position (/',,). Low PSNR values implies high distortion. 
doi:1 0.1 371 /journal.pone.0081 976.g001 
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Figure 2. Similarity between an original document and its fingerprinted copy as the energy assigned to users (/>'„) and groups!/; ,) 
vary, with fixed values of fingerprint length (L) and insertion position (/'„). High SSIM Index values implies a high similarity. 
doi:1 0.1 371 /journal.pone.0081 976.g002 



circumstances or geographic location. Under this assumption, a 
hierarchical fingerprinting technique based on spread spectrum 
has been proposed. Using this technique, users are assigned to 



Table 2. Perception of respondents to fingerprinted 
documents with different PSNR and SSIM Index values. 





Metrics 




Typeofanswer 






PSNR 


SSIMIndex 


1 


2 


3 


4 


30.28dB 


0.996 


52 


41 


7 


0 


28.1 IdB 


0.991 


15 


69 


16 


0 


26.03dB 


0.989 


14 


74 


12 


0 


24.21 dB 


0.978 


47 


45 


3 


5 


22.16dB 


0.975 


43 


55 


1 


1 


20.16dB 


0.953 


21 


64 


13 


2 


18.08dB 


0.933 


21 


58 


16 


5 


16.68dB 


0.922 


7 


79 


13 


1 


14.21dB 


0.877 


12 


73 


13 


2 


12.32dB 


0.857 


4 


55 


39 


2 


10.88dB 


0.783 


0 


36 


62 


2 


8.85dB 


0.669 


4 


7 


61 


28 


6.00dB 


0.500 


4 


6 


34 


56 


4.54dB 


0.347 


2 


6 


13 


79 
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groups, and the fingerprint is generated from the user ID and its 
group ID. At the detection stage, first, the group of colluders is 
identified and then the users who belong to that group are 
identified. This reduces the computational cost along with the 
probability of false-positive detection. 

Kuribayashi proposed in [31] a hierarchical fingerprinting 
scheme based on Code Division Multiple Access (CDMA). In this 
scheme, users are organized in groups, and the user's fingerprint is 
represented by a spread spectrum sequence, one for the user ID 
and another for the group ID. These sequences are orthogonal 
because they are Discrete Cosine Transform (DCT) basis vectors 
modulated by a pseudorandom sequence (PN) of 1 and — 1 
values, allowing retention of orthogonality. 

The spread spectrum sequence for a group i is generated from a 
vector V of length L with all entries equal to 0, adding an amount 
of energy ft g to the entry at position i. Then, the Inverse DCT 
(IDCT) is applied to V to obtain the i'-th basis function of the 
DCT. Finally, V is modulated by a sequence PN generated from a 
secret key s, which provides security to the scheme because only 
the one who knows that key is able to detect groups. The spread 
spectrum sequence Wt that is generated for the !-th group is 
expressed by Equation 1 . Each component in the spread spectrum 
sequence for the group ID can be assigned to a group; therefore, 
the total amount of groups supported is L. 

Wi = PN(s)®IDCT(i,P g ) (1) 
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Figure 3. Perception of 100 respondents to fingerprinted digital documents with different PSNR values. 

doi:10.1371/journal.pone.0081976.g003 



Generation of the spread spectrum sequence from an user j 
belonging to a group i is performed similar to the sequence of the 
group, with the difference that the PN seed is given by the group 
ID. The spread spectrum sequence assigned to the j-th user is 
computed according to Equation 2. 

Wj=PN(i)®IDCTUA) (2) 

Using the group ID as the seed of PN, a link is stablished 
between the group and the user. As the number of groups and 
users per group is L, the total amount of users supported is L 2 . 
With the spread spectrum sequences of user and group, the 
fingerprint that represents a user is generated from Equation 3 and 
the total energy of the fingerprint is obtained from Equation 4. 

W i j=W i +W j (3) 

? = $ + & (4) 

The generated fingerprint is inserted in the frequency compo- 
nents of the host image. The starting point of insertion denoted as 
P w is selected from the low and middle frequencies. When a pirate 
document is detected, a sequence Wq is extracted from its 
frequency components starting from the point P w , in a nonblinded 
fashion. From Wtj, it is possible to obtain a detection sequence d g 



for group IDs, and a detection sequence d u to detect user IDs. 
Detection is performed using two thresholds, one for the group 
(T g ) and another for the user (T u ). Upon detecting the presence of 
a group i, it is possible to detect the users belonging to that group. 
However, if a user belongs to the group i+ 1, the user will not be 
detected until detection is performed for that group and the spread 
spectrum sequences of its users in the image are examined. The 
system reported in [31] has proved be faster than the previous 
work as its fingerprint detection strategy is carried out using fast 
DCT algorithms. 

In this study, the performance of the scheme presented in [31] is 
investigated for digital document distribution applications. Nu- 
merous experiments are performed to evaluate the properties of 
perceptual transparency and robustness. The outline of this paper 
is as follows: First, experimental results and discussion are 
presented. In the Methods and Materials section, details of the 
insertion, detection and determination of thresholds are provided, 
and the Fast DCT along with the PSNR and SSIM Index metrics 
used to evaluate perceptual transparency are recalled. Finally, 
conclusions are provided. 

Results and Discussion 

The proposed fingerprinting system implementation must satisfy 
perceptual transparency and robustness under averaging collusion 
attacks in digital documents. To achieve that goal, adequate values 
for the energy of user (/?„), energy of group ()8A fingerprint length 
(L), and insertion position (P w ) are determined through experi- 
mentation. First, values that satisfy perceptual transparency are 
determined. Then, their best configuration is identified for 
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Table 3. Configurations of /}„, /? and L that satisfy 
perceptual transparency for a fixed P u ,. 





Configuration 


ft, 


ftr 


L 


1 


200,000 


50,000 


50,000 


2 


200,000 


100,000 


50,000 


3 


1 50,000 


1 50,000 


50,000 


4 


100,000 


200,000 


50,000 


5 


50,000 


200,000 


50,000 


6 


200,000 


50,000 


100,000 


7 


200,000 


100,000 


1 00,000 


8 


1 50,000 


1 50,000 


1 00,000 


9 


1 00,000 


200,000 


1 00,000 


10 


50,000 


200,000 


1 00,000 


11 


200,000 


50,000 


1 50,000 


12 


200,000 


100,000 


1 50,000 


13 


1 50,000 


1 50,000 


1 50,000 


14 


100,000 


200,000 


1 50,000 


15 


50,000 


200,000 


1 50,000 


16 


200,000 


50,000 


200,000 


17 


200,000 


100,000 


200,000 


18 


1 50,000 


1 50,000 


200,000 


19 


1 00,000 


200,000 


200,000 


20 


50,000 


200,000 


200,000 


21 


200,000 


50,000 


250,000 


22 


200,000 


100,000 


250,000 


23 


1 50,000 


1 50,000 


250,000 


24 


100,000 


200,000 


250,000 


25 


50,000 


200,000 


250,000 


26 


200,000 


50,000 


300,000 


27 


200,000 


100,000 


300,000 


28 


1 50,000 


1 50,000 


300,000 


29 


1 00,000 


200,000 


300,000 


30 


50,000 


200,000 


300,000 


31 


200,000 


50,000 


350,000 


32 


200,000 


100,000 


350,000 


33 


1 50,000 


1 50,000 


350,000 


34 


1 00,000 


200,000 


350,000 


35 


50,000 


200,000 


350,000 
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maximum colluder detection. Simulations of collusion attacks are 
performed, generating pirate copies of digital documents in TIFF 
(lossless) and JPEG (lossy) formats. For the experiments, the input 
documents have been selected from a set of 1000 different digital 
documents in a JPEG format with average dimensions of 
1900x2700 pixels, these documents have been obtained from 
[32]. Statistical significance is achieved for this sample size as 
random /-tests results (for significance level equal to 0.05) showed a 
p-value about 9.9514 x 10~ 209 for the biggest. 

Perceptual Transparency 

To find values for /?„, /?„, L, and P w that satisfy perceptual 
transparency, these values were combined as defined in Table 1 to 
generate fingerprints, which were inserted in a set of 1000 digital 



documents. Then, the perceptual transparency was evaluated by 
using PSNR and SSIM Index as metrics (these metrics are detailed 
in the Methods and Materials section). It was found that ji u and fi g 
are the factors that affect the perceptual transparency more 
negatively as their values increase, as shown in Figure 1. Also, it 
was noticed that lower values of P w slightly increase the PSNR 
value, whereas when L increases, the PSNR value is slightly 
reduced. By fixing the values of P„. and L and varying the values 
of $ u and P g , it can be observed in Figures 1 and 2 that the 
obtained values of PSNR and SSIM Index are correlated. This is 
consistent with the findings of Hore and Ziou [33], 

Inquest Evaluation 

As the perceptual transparency for digital documents is achieved 
by preserving the legibility of the text, an inquest was applied to 
100 respondents to determine the lowest PSNR values that 
satisfied legibility of digital documents. The inquest consisted of 
the evaluation of a subset having fifteen digital documents with a 
PSNR value in the range of 4-30 dB. The possible answers 
available for the respondents were the following: 

1. 1 do not perceive image distortion. 

2. 1 perceive image distortion but the text is easily readable. 

3. I perceive image distortion and the text is hardly readable. 

4. The text is not readable. 

The results of the inquest are shown in Table 2. These results 
are plotted in Figure 3 that shows the change in the perception of 
the respondents while the PSNR value in the fingerprinted 
documents decreases. Most of the respondents considered that 
digital fingerprinted documents with PSNR values greater than 
14 dB and SSIM Index greater than 0.887 are legibles; otherwise, 
those documents were considered as nonlegibles. Therefore, 
configurations of /?„, /3 g , L, and P K that generate fingerprinted 
documents with PSNR and SSIM Index equal to or greater than 
these values satisfy perceptual transparency. 

Collusion Attack on Digital Documents in TIFF Format 

From the generated set of /?„, /L, L, and P w , configurations that 
generate fingerprinted documents were selected with a PSNR 
value of 16 dB to satisfy perceptual transparency. The selected 
values of parameters of /?„, /L, and L are shown in Table 3. As the 
best value for P w is the lowest one (1/6), this value was selected as 
the most appropriate and was fixed for the next simulations. 

Robustness Factors 

In Table 3, there are only five configurations of fi u and ft g for 
each value of L with a PSNR of 16 dB. To determine the values of 
/?„ and f> g that allow the highest colluder detection probability, the 
first five configurations of /?„ and p g were chosen with L = 50,000. 
Then, averaging collusion attacks were simulated over 50 digital 
documents, from 2 colluders to 300 colluders that belong to the 
same group using the selected values. The fingerprinted 
documents were generated in the lossless format TIFF with a 
RGB color scheme. 

The results of the simulation are plotted in Figures 4 and 5. 
Configuration 1 (/?„ = 200,000, ^ = 50,000, L = 50,000) has the 
highest number of detected colluders, whereas configuration 5 
(fi u = 50,000, fi g = 200,000, L = 50,000) has the lowest one. This is 
significant because the values of /?„ and fi„ in configurations 1 and 
5 are inverted, because in the averaging collusion attack, the value 
of P u of each colluder is reduced proportionally to the number of 
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Figure 4. Number of detected colluders after a collusion attack for fingerprints generated with the configurations 1, 2, 3, 4 and 5 in 

Table 3 and P„ , = 1/6. 
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colluders. However, as fi g is the same for each user, the energy of 
P g is first accumulated as many times as the number of colluders, 
and then it is divided by the same value, having no changes. 
Therefore, the amount of energy assigned to users must be the 
highest possible to resist a big amount of colluders. Figure 6 shows 
the detection sequence from a pirate document generated from the 
collusion of two colluders with ID = 300 and ID = 600. The 
fingerprints of both the colluders were defined by configuration 1, 
and despite the reduction in energy, the threshold T u could still 
detect them. Simulations performed for lossless digital documents 
presented a considerable amount of noise under T u . Much of this 
noise was because of the pixels in the host image after fingerprint 
insertion, which were not in the range of 0—255. Therefore, 
negative values were set to 0 and higher values of 255 were set to 
255. This rounding off was reflected as noise. 

Fingerprint Length 

After defining the values of /?„ = 200,000 and ft, = 50,000 as the 
configuration with the highest detection rate, a new simulation of 
collusion attacks was performed to determine the value of L that 
provides the best detection rate. Configurations with different 
values of L, ft, = 200,000, and ft, = 50,000 were selected from 
Table 3, and these configurations were 1,6, 11, 16, 21, 26 and 31. 
The simulation results are plotted in Figures 7 and 8. As the value 
of L increased, the detection ratio also increased. In [25] 
Kuribayashi had already mentioned this behavior for images in 
general, but in that work, it was indicated that the value of L is 
limited by the image size. It is possible to calculate the highest 
value of L for the document samples, as shown in Equation 5, 



where L max is the maximum value of L, D w is the document 
width, Dh is the document height, D com is the number of color 
components (typically 3), and P w is the insertion point of the 
fingerprint. 

Lmax — (D w x Z)/, x D com ) P w (5) 

Using Equation 5, for the sample documents, L max is 
12,150,000. Using L = 350,000, it was possible to detect 270 
colluders in a collusion attack with 300 colluders. Reducing ft and 
ft, to increase L to maintain the perceptual transparency could be 
a good tradeoff. 

Collusion Attack on Digital Documents in JPEG Format 

Once identified that configuration 31 (ft, = 200,000, ft, = 50,00 
and L = 350,000) in Table 3 leads to the highest amount of 
detected colluders, these values were used to generate fingerprints 
and simulate collusion attacks using the lossy image format JPEG. 
In these simulations, the fingerprinted digital documents were 
compressed storing the image in a JPEG format. Then, the 
collusion attack was simulated and the resulting document was 
stored again in JPEG format. This implies that the fingerprints in 
the digital document were affected twofold by the compression 
process. Figure 9 shows the results of the simulated collusion 
attacks from 2 to 250 colluders, over 15 digital documents with 
their quality reduced to 80, 60, and 30%. Comparing the 
maximum amount of colluders detected using documents in JPEG 
format (Figure 9) with those using documents in TIFF format 
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O Configuration 1 
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Number of colluders 
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^ Configuration 2 
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Figure 5. Probability of detected colluders after a collusion attack for fingerprints generated with the configurations 1 , 2, 3, 4 and 5 

in Table 3 and P w = 1/6. 

doi:10.1371/journal.pone.0081976.g005 



(Figure 7), it was observed that the detection rate in documents in 
JPEG format diminished drastically. However, a considerable 
amount of colluders were still detected in digital documents in the 
lossy format JPEG, making this an attractive characteristic for 
organizations. 

Performance Evaluation 

The main functions of the proposed system are the insertion of a 
fingerprint and its detection. The execution time of these functions 
are critical when digital documents are fingerprinted in a 
production environment. To determine if the system holds feasible 
execution times, performance of insertion and detection of 
fingerprints were evaluated over 1 00 digital documents in JPEG 
and TIFF format (implementation details can be found in the 
Software Implementation section). The insertion evaluation 
considers the document read time, the fingerprint insertion time 
and the fingerprinted document write time. On the other hand, 
detection evaluation considers the document read time and the 
fingerprint detection. The results obtained are shown in Table 4. It 
is noticeable that insertion and detection time does not differ 
significantly between the formats. The most time-consuming task 
in the insertion and detection is the execution of the DCT/IDCT, 
with an execution time higher than 3500 ms. This can be observed 
in the relation of execution time between the insertion and 
detection of fingerprints in Table 4; the execution time of the 
insertion function requires two transformations and is twice the 
time of detection in which just one transformation is needed. 
Considerable time is spent in the insertion due to DCT transforms 



applied over digital documents of large dimensions; however, this 
time is still feasible for real-life applications. The time for 
performing the detection of users after the digital document has 
been read and transformed with the IDCT is very low, around 
74 ms, allowing a wide amount of users per group and having an 
acceptable detection time. 

Comparison 

In related works (see the Related Work section), techniques for 
information insertion in digital documents can be divided into line 
shifting encoding, word shifting encoding, and character space 
encoding. As mentioned earlier, these techniques have not 
reported their robustness to collusion attacks. On the other hand, 
schemes described in [29] and [28] could potentially resist 
collusion attacks because they are based on spread spectrum 
techniques. However, these works do not report their robustness to 
these attacks. It is worth mentioning that for none of these 
techniques the impact of fingerprint insertion in perceptual 
transparency has been evaluated. Also, as these insertion 
techniques have not been conceived originally for fingerprinting, 
the supported amount of users is not provided. Table 5 shows a 
comparison of the approach proposed in this work against the 
approaches reviewed in the Related Work section, where insertion 
techniques are used on digital documents represented as images. 
Unlike related works, the selected fingerprinting scheme for the 
proposed system was validated regarding perceptual transparency 
and robustness to collusion attacks through experimentation. 
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Figure 6. Detection of 2 colluders with IDs 300 and 600, with B u = 200,000, B g = 50,000, P„ . = 1/6 and L = 350,000. The red line defines 

the threshold value. 
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Materials and Methods 

This section describes the Fast DCT and Inverse DCT (IDCT) 
Algorithms as well as the fingerprint insertion and detection 
methods. Also, the PSNR and SSIM Index metrics used for 
evaluation of the perceptual transparency in the proposed system 
are described. 

Fast DCT and Inverse DCT Algorithms 

It is known that the Fourier transform of a real-even function 
f( — x)=f (x) is real-even, and i times the Fourier transform of a 
real-odd function /( — x) = —f(x) is real-odd; thus, for these 
symmetry conditions, it is not necessary to use complex inputs/ 
output. Therefore, it is possible to compute the DCT or the 
Discrete Sine Transform (DST) by utilizing an FFT algorithm. 

Let the input vector x(n = 0..M — 1) be even around n= —0.5 
and even around n = M — 0.5; it is possible to show that DFT(x) is 
the nonnormalized DCT of x, Y„ 0 „o(k) described as follows: 



1 kn 

Y non0 (k) = 2 cos [(«+-)— ] 

n = 0 



(6) 



The basis set described in Equation (7) is nonorthogonal; 
therefore, it is necessary to normalize Equation (6) to get the 
orthogonal transform as follows: 



Y 0 (ky- 



{\/^AM)Y non0 (k) 
(\/V2M)Y non0 (k) 



if fc = 0 
otherwise 



(8) 



On the other hand, let the input vector Y(k = 0..M — 1) be 
even around k = 0 and odd around k = M; it is possible to show 
that DFT(F) is the nonnormalized Inverse DCT of Y, x„ 0 „o(ri) 
described as follows: 



M-l 



x mn o(n)= T(0) + 2 T(A:)cos[(n+ -) 



k=\ 



1 . kn. 



2' M' 



(9) 



As shown in Equation (6), a normalization procedure is 
necessary to obtain the orthogonal transform. The normalization 
is carried out as follows: 



with basis: 



b(n,k)=2cos[(n+^)~] 



(7) 



xo(k) -- 



--X„ono(n) 



(10) 



In the literature, fast algorithms for the DFT have been 
extensively reported and very efficient software libraries exist [34] . 
In this work, these libraries are utilized as a module of DCT 
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computing, reducing the effort required for efficient implementa- 
tion to a normalization stage implementation for DCT. 

Fingerprinting Method 

The fingerprinting method for digital images consists in two 
procedures (the fingerprint insertion and detection) as follows: 

Fingerprint Insertion Method. The following steps describe 
the insertion method of the fingerprint W;j described in Equation 
1. This method is graphically shown in Figure 10: 

1 . Transform the image to the frequency domain using the DCT 
function. 

2. Select L coefficients of the low and middle frequencies from a 
position P,,.. The selected coefficients are denoted as: 

F={vB,vi,...,Vi_i} (11) 



Fingerprint Detection Method. The following steps de- 
scribe the detection of the user's fingerprint in an illicit copy of an 
image representing a digital document: 

1 . Transform the illicit copy to the frequency domain using the 
DCT function. 

2. Select L coefficients of the low and middle frequencies from the 
position P,,,. The selected coefficients are denoted as: 

F = {v 0 ,vi,-..,vz.-i} (13) 

3. Detect the group ID: 

(a) Generate PN using the secret key s. 

(b) Use the DCT function to extract the detection sequence d g : 



d g = DCT{PN{s)®(V-V)) (14) 

3. Insert the fingerprint additively in the extracted coefficients: 



V=V+Wij (12) 



4. Transform the image to the spacial domain using the Inverse 
DCT function to get the fingerprinted image. 



(c) Calculate the variance <s 2 g of d g considering the probability 
distribution and determine the threshold T g from a given 
false-positive denoted as Pe g : 

T g = ^2^ g edc-\2Pe g ) (15) 
where erfc 1 stands for the inverse complementary error 
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Figure 8. Probability of detected colluders after a collusion attack for fingerprints generated with LI, = 50,000, B„ = 200,000 and 

-P» = l/6. 

doi:1 0.1 371 /journal.pone.0081 976.g008 



function defined in Equation 16 and is computed as 
shown in Equation 17. 



erfc(x) = 



1 



e ' dt 



(16) 



(17) 



(d) If d g in the input k exceeds the threshold T g , it is determined 
whether k is the group ID. 

4. Detect the user ID: 



(a) Generate PN using the ID k of the detected group. 

(b) Use the DCT function to extract the detection sequence d u : 



d„ = D CT(PN(k) <g> ( V - V)) 



(18) 



T u = ^lerf C - l (2Pe u ) 



(19) 



(d) If d u in the input h exceeds the threshold T u , it is determined 
that h is the user ID. 

This hierarchical detection of fingerprints is illustrated in 
Figure 1 1 . 

Evaluation of Perceptual Transparency 

In this study, two methods for evaluation of the perceptual 
transparency of fingerprinted images were utilized. Those methods 
are described as follows: 

Peak Signal-to-Noise Ratio. Peak Signal-to-Noise Ratio 
(PSNR) measures the similarity of two images. It defines the 
relation between the maximum energy of a signal and the noise 
that affects expressing this difference in decibels [33,35]. Given an 
8-bit grayscale image / and a copy of the altered image g, both of 
size M xN, the PSNR between / and g is defined by: 



PSNR(f,g) = Wlo Sl 



f 255 2 
\MSE(f,g\ 



(20) 



(c) Calculate the variance a„ of d„ in a similar way as in 
Equation 17, considering the probability distribution and 
determine the threshold T u from a given false-positive 
denoted as Pe u : 



MSE(f,g)-- 



1 

(M-N) 



(21) 



i=ij=\ 
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Figure 9. Number of detected colluders after a collusion attack with lossy digital documents for fingerprints generated with 

B g = 50,000, B„ = 200,000, L = 350,000 and P w = 1/6. 

doi:10.1371/journal.pone.0081976.g009 



For Mean Square Error (MSE), the difference between pixels^ 
and gjj is considered as an error that generates image quality loss. 
As MSE tends to zero, the value of PSNR approaches infinity. The 
higher the PSNR (f, g) values, the higher the image quality. 

Structural Similarity Index. Structural Similarity Index 
(SSIM) is a particular implementation of the structural similarity 
philosophy [33], and it is considered correlated with the human 
visual system [36] . For two image signals x and y, comparison of 
three components: luminance, contrast, and structure, is neces- 
sary. These components are relatively independent because object 
structures in images neither depend on illumination nor contrast. 
The luminance is defined by the function in Equation 22, where 
H x is the standard deviation of x. 



Then, the closeness of the contrast of the images is measured as 
shown in Equation 23, where a x is the variance of x. 



c{x,y)-- 



2a x a y + Ci 



(23) 



The structure comparison is defined by Equation 24, where a xy 
is the covariance between x and y. 



s(x,y)-- 



-C, 



a x a v + Ci 



(24) 



l( x — (jfe^jy p2) Finally, the three components are combined to get the overall 

iH x +fJy + C{) similarity measure expressed in Equation 25, where the exponents 

a, P, and y are positive integers that define the importance of each 
component. 

Table 4. Performance evaluation of insertion and detection of fingerprints. 





Format 


Insertion time (read, insert, write) 


Detection time (read, detect) 


Detection time(detect) 


JPEG 


7574.14 ms 


3764.19 ms 


76.53 ms 


TIFF 


7311.45 ms 


3853.06 ms 


73.55 ms 
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Table 5. Comparison between the proposed work and other insertion techniques of fingerprinting. 



Robustness 


Work 


Domain 


Strategy 


Collusion- 
resistant 


Lossy 

compression 


Perceptual 
transparency 


Number of users 


Low et al. [16] 


space 


line shifting encoding 


not reported 


yes 


not reported 


not reported 


1 nw ft n\ T1 Rl 

LLJVV Ct Hi. [_ ' 




linf^ ^nittinn f^nfoH i nri 
Milt: _> 1 1 1 1 LI I iM trl My 


not rf^nortf^H 


yes 


not rf^nortf^ri 
1 1 KJ I 1 \J KJ 1 IcLI 


not rf^nortf^H 


Alattar [20] 


space 


line shifting encoding 


not reported 


yes 


not reported 


not reported 


Kim ef al. [21] 


space 


word shifting encoding 


not reported 


yes 


not reported 


not reported 


Yawai et al. [25] 


space 


word shifting encoding 


not reported 


yes 


not reported 


not reported 


Huang ef al. [22] 


space 


character space encoding 


not reported 


yes 


not reported 


not reported 


Chotikakamthorn [19] 


space 


character space encoding 


not reported 


yes 


not reported 


not reported 


Qadir et al. [28] 


frequency 


spread spectrum 


not reported 


not reported 


not reported 


not reported 


He ef a/.[29] 


frequency 


spread spectrum 


not reported 


yes 


not reported 


not reported 


Proposed 


frequency 


spread spectrum 


yes 


yes 


guarantee 


L 2 
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SSIM(x,y) = [l(x,yT[c(x,yf[ S (x,y)} 7 



(25) 



Software Implementation 

The fingerprinting system was implemented in C++ language 
using the GCC compiler version 4.2.1 with Ubuntu 12.10 as 



operative system, an Intel Core i5 processor at 2.7 GHz and 4 GB 
RAM. To perform the DCT and IDCT Transform, the FFTW 
library version 3.3.3 was used [34]. To read and write TIFF 
images, the libtiff version 3.6.1 was used [37], and for the JPEG 
image, the jpeglib version 8.0 was used [38] . For computing the 
PSNR and SSIM Index value the IQA library version 1.1.2 was 
used[39]. Finally, the source code is available upon request. 



Secret key s 



^ PN Generator 



Mh DCT basis 
function 



^ PN Generator 



/-th DCT basis 
function 



Original digital 
document 


\ 


t 


D( 


;t 




\ 


I 


IDCT 


V 


f 


Fingerprinted 
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Figure 10. Diagram of fingerprint insertion method. 
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Figure 11. Diagram of fingerprint detection method. 

doi:10.1371/journal.pone.0081976.g011 



(K h i) 



Detection of 
user ID 



Conclusions 

In this study, the collusion-resistant fingerprinting method 
proposed by Kuribayashi has been implemented in the context of 
restricted distribution of digital documents. The performance of 
the system was evaluated by defining permissible levels of 
distortion in terms of legibility of fingerprinted documents, and 
appropriated values of parameters have been determined to 
achieve higher colluder detection. Estimation of the required 
computing time for insertion and detection of fingerprints in 
digital documents was also carried out. For the fingerprinting 
scheme, it was shown that the energy assigned to users must be 
higher than that assigned to groups to have a higher colluder 
detection probability. Furthermore, an equation to determine the 
maximum fingerprint's length was proposed. The system could 
fully detect up to 270 of 300 colluders for lossless compressed 



digital documents. The detector performance after lossy compres- 
sion remains competitive for real-work environments. The number 
of users available and the high-quality lossy compression 
robustness make the proposed system suitable for implementation 
in a production environment. 
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